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@ Reconfigurable ASIC. 

@ A corrfigurable semi-conductor integrated cir- 
cuit with particular application as a re-con- 
figurable application specific device. In order to 
be able to rapidly switch between two or more, 
preferably several, configurations, the inventnn 
pfovides a configurable semi-oonductor inte- 
grated circuit in which an area (1) thereof is 
formed with a plurality of cells (2) each having 
at least one function and interoonnecttons with 
at least some other said cells (2). At least some 
of the plurality of cells have interconnections 
(25) which are electrically selectatile as to their 
conduction state, and at least some of the 
plurality cells have interconnections (YA-YD) 
which are pre-wired. Each ceil has two or more 
possible configurations, each configuration be- 
ing defined by the cell function and/or its inter- 
connection with other celts according to cell 
configuration data, and further comprising 
means (36, 38, 40) storing configuration data for 
at least two ceil configurations (per cell) and 
means (30, 32, 34, 42, 48) to enable one of tiie 
possible cell configurations according to the 
cell configuration data selected. 
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Tile present invention relates to a configurable in* 
tegrated circuit, with particular emphasis on a re-con- 
figurable applicatbn specific device but without linv 
itation to same. 

Micro-processors are designed into many appli- 5 
cations because of their low cost and high perfor- 
mance. However, for many applications such as im- 
age compression and digital signal processing they 
are too slow. Modifications to the basic micro-proces- 
sor architecture has led to several new devices, digi- io 
tal signal processors (DSP), reduced Instruction set 
computers (RISC) and custom processors (CP). Each 
of these devices are optimised to perfbnm a restricted 
number of tasks but at very high speed. IMany appli- 
cations require several types of such devices to ach- is 
ieve the necessary level of perfbnnance. This is be- 
cause of the requirement to perform different types of 
computational tasks over a period of time or the lim- 
ited capability of each device. Essentially these de- 
vices are used as low cost high performance numer- 20 
ical engines, each optimised to implement a general 
class of algorithms. However, a designer frequently 
requites a different architecture to effteientty imple- 
ment a new algorithm and the usual practice in such 
circumstances is to design a custom processorforthis 25 
task. This leads to long and expenshre design cydes 
and does not allow the designer any flexibility to 
change the algorithm. 

Field programmable gate an^ays (FPGAs) are 
commonly used to replace standard products and 30 
they could be used as a numerical engine. However, 
they are general purpose devices that cannot effi- 
ciently implement high speed circuits. In order to ach- 
ieve the level of complexity that Is normally required, 
several FPGAs would be necessary which would in- 35 
crease the cost of the final system. Some FPGAs are 
configured using on chip static random access mem- 
ory (SRAM) and these devices can be re-program- 
med to perform different tasks which could lead to 
greater flexibility and higher levels of performance. 40 
However, these devices are connected to an external 
source of configuration data that is accessed by the 
device to configure internal resources. The time to 
configure or re-configure the FPGA can be several 
milliseconds, due to the necessity to import conf igur- 45 
ation data from an external source, and this time is 
several orders of magnitude too slow. Reconfigura- 
tion speeds of less than 100 nano-seconds are re- 
quired for high performance applicattons. As such 
FPGA's cannot be reconfigured fasi enough to make so 
them suitable for use as a high perfonmance numeri- 
cal engine. In FPGAs a considerable amount of silicon 
area is committed to the configuration menwry which 
is required to program interconnect resources. Whilst 
in theory FPGA's oouid accommodate an addittonal 55 
configuration t>y increasing the amount of on chip 
mentory whteh is available to hold configuration data 
this would probably increase the size of the chip by 60 



per cent which would be prohibitive for high density 
arrays. 

The aim of the invention is to provide a re-config- 
urable architecture which can rapidly switch between 
two or more, preferably several, configurations. An- 
other aim of this invention is to provide a device that 
is specifically optimised to carry out functions for nu- 
merically intensive applications. Another aim is to pro- 
vide a device that prior to the application of power 
contains one or more boot up primary configurations, 
suitable for configuring the device into the intended 
application. Af urther aim is to produce a device that 
has provisfon for passing data between successhm 
conf Igurattons of the (base) device. Astill further aim 
is to ensure that during configuration of the device, 
data Is held in a safe condition and that switching cur- 
rents are minimised. Astill further aim is to provide a 
configuratton cache that will allow updating of conf ig- 
uratfon memories that are not currently in use. An- 
other aim of the invention is to allow the device to se- 
lect its own configuration from an external source of 
oonfiguratk)n data. 

Another aim is to reduce the number of progranv 
mable interoonnectfons by pre-wiring a portion of the 
logic into the required configuration. 

A yet further aim is to increase performance of 
the device by pre-arranging specified primary func- 
tions to specific areas of the device such primary 
functions being substantially pre-wired. 

Accordingly one aspect of the inventton provides 
a configurable semi-conductor integrated circuit in 
which an area thereof is formed with a plurality of cells 
each having at least one function and interconnec- 
tions with at least some other said cells, at least some 
of the plurality of cells having interconnections which 
are electrically selectable as to their conduction state, 
and at least some of the plurality of cells having inter- 
connections which are pre-wired, each cell has two or 
more possible conf igurations, each configuration be- 
ing defined by the cell function and/or its interconnec- 
tion with other cells according to cell configuration 
data, and further comprising means storing conf igur- 
atfon data for at least two celt conf iguratfons (per cell) 
and means to enable one of the possible cell conf ig- 
urations according to the ceD configuration data se- 
lected. 

By pre-wired in relation to interconnect we mean 
uninterruptable as to its oonductton state. The config- 
uration data controls selet^ion of the cell functfon 
and/or cell interconnections preferably using decod- 
ers or alternatively controlled directly from memory. 
Thus for example the cells configuration data deter- 
mines the routing of the signal through the cell. Direct 
connection paths exist between the configuration 
stores, the decoders and the selectable functions and 
interconnections. The term function as used herein 
may be a logic function, arithmetic function, or inter- 
connect function. A cell may have one or more of 
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these functions or a contbination of two or more of 
these. Preferably the configuration data stores are 
disposed in the cell. The desired configuration is se- 
lected using an instruction bus receiving signals from 
a sequencer and controller. One or nrK>re of the con- 5 
figurations may be pre-wired (le. not programmable). 
Advantageously one or more of the configuration data 
stores are progranrvnable using a data transfer bus. 
Where more than one store is programmable an in- 
struction update bus is provided to v/rite enable the 10 
required configuration store. Configuration stores not 
currently accessed to control interconnection and/or 
cell function can be updated using the instruction up- 
date bus. 

Since the present invention \s particular y con- 15 
cerned with an application specific device which is 
optimised to perform a restricted number of tasks at 
high speed but which is quickly reconfigurable during 
program executbn (when required) to perform some 
other specific task, cells are optimised for a primary 20 
function according to a primary configuration. Advan- 
tageously the primary configuration data is pre-wired. 
It Is convenient to have two alternate pre-wlred pri- 
mary configurations. Cells can be and most usually 
will be optimised for different primary functions. Ad- 25 
vantageously the pre-wired interconnections are 
used in connection with the optimised functions. 

A possible primary function is that of an adder. 
Another aspect of the invention provides a multi-bit 
adder for summing at least two multi-bit words com- 30 
prising a f iret multi-bit adder block for summing the 
least significant bits and at least one further multi-bit 
adder block for summing the most significant bits and 
having sum selection means wherein said further 
multi-bit adder block calculates the two possible 35 
sums resulting from a carry out from the previous 
block being equal to 'C and '1' respectively and 
wherein the sum selection means selects the sum of 
the further multi-bit adder block according to the canry 
out calculated from the previous block. 40 

In the case of a Digital Signal Processor applica- 
tion, some cells will be optimised as Arithmetic Logic 
Units (ALU) while other cells may be optimised to car- 
ry out functions such as instruction decode or as 
processor registers. The number of different cells Is 45 
only limited by the size of the array of cells. In practice 
the anay will be divided into a number of discrete 
areas that are particularly efficient at implementing 
respecth^e primary functions. It will be apparent that 
each of these cells has the capability to implement so 
another functton and usually a range of other func- 
tions according to other configurations. These addi- 
tional functions are controlled by the controller and 
sequencer whose role is to ensure that the correct 
function Is available when required. Primary func- 55 
tions may use general Interconnect resources, but 
preferably they have their own dedicated resource for 
high speed connecttons between primary functions 
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of other cells. In this way the performance of the de- 
vice is not dependent on a general programmable in- 
terconnect resource and by connecting primary func- 
tions through resources with smaller parasltk; loads, 
the device can operate faster. 

in order to safeguard data when changing be- 
tween configurations each cell has a latch controlled 
by a function control bit Transient current is reduced 
when switching between configurations by the provi- 
sion of a buffer In each cell, the buffer being control- 
lable as to is state during reconfiguration by a control 
Una. 

It will be apparent that whilst th is device has spe- 
cific applicatton in the field of numerical engines such 
as DSP*s, the primary functions can be chosen to suit 
other appi ications. Accordingly the techniques can be 
applied to any application. For example, another ap- 
pllcatnn is as a programmable communteatlons de- 
vice. 

Another aspect of the invention also provides a 
method of configuring a configurable semi-conductor 
integrated circuit in which a sequence is programmed 
with data to facilitate selectfon of a requM configur- 
ation from at least two possibilities. Usually each of 
a plurality of cells will have at least two conf iguratfon 
possibilities. Advantageously the configurations are 
programmable and the method further comprises in- 
putting and storing configuration data. A further ad- 
vantageous feature is the ability to program the se- 
quences to write over previously stored conf iguratfon 
data at a prescribed point in operation of the circuit 
An aspect of the invention provides a semi-conductor 
integrated circuit in which the circuit configuration is 
changed according to a pre-programmed sequence of 
configuratfon during opmtion of the device. 

The present invention will now be described by 
way of example only with reference to the accompa- 
nying drawings; in which:- 

Figure 1 is a schematic layout for re-configurable 

application specific device embodying the inven- 

tfon; 

Figures 2 and 3 illustrate diagrammatically the 
feature of the core architecture having different 
configurations and sequential access; 
Figure 4 illustrates diagrammatically the feature 
of the core having cells which are optimised to inv 
plement specific functions; 
Figure 5 illustrates diagramniatk:ally a primary 
configuration for the device as a Digital Signal 
Processor (DSP); 

Figure 6 illustrates diagrammatk:ally a second- 
ary configuration for the device as a large multi- 
plier; 

Figure 7 illustrates schenrmtically the layout of a 
cell Including configuration nftemory means; 
Figure 8a illustrates diagrammatically the possi- 
ble arrangement of the cells in blocks v^th opti- 
mised functions; 
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Figure 8b illustrates schematically programma- 
ble local and global interconnect resources for 

the cells; 

Rgures 9a and 9b illustrate diagrammatically 
how the global interconnect resources are con- s 
nected to the cell input and output multiplexers; 
Rgures 9c and 9d Illustrate diagrammatically an 
array of cell blocks and the arrangement of cells 
within a cell blodc; 

Rgure 10 illustrates diagrammaticaOy cell output lo 
state control: 

Rgures 11, 12 and 13 illustrate diagrammatically 
three logic cell variants namely an Arithmetic 
Logic Unit function (ALU), an Accumulator func- 
tion (ACC), and a Decode cell function respec^ is 
lively; 

Rgure 14 Illustrates diagrammatically examples 
of d if ferent functions from the ALU and ACC opti- 
mised core cells; 

Rgure 15 illustrates diagrammatically details of 20 
configurable Static Random Access Memory pro- 
visions; 

Rgure 16 illustrates diagrammatically further de- 
tails of the cell configuration memory; 
Rgure 1 7 illustrates diagrammatically instruction 25 
bus connections for DSP cells; 
Rgure 18 illustrates diagrammatically a novel 
parallel carry select adder architecture which can 
be configured by the device; 
Rgure 19 illustrates a ceil configured to imple- 30 
ment a single stage carry select adder 
F^ure 20 illustrates a cell configured to imple- 
ment two carry select addere; 
Figure 21 illustrates an alternative cell configur- 
ation to implement a single stage carry select ad- 35 
den and 

F^ure 22 illustrates diagrammatically a DSP 
Timing Diagram. 

The present invention is described in the context 
of an Integrated circuit intended for an application 40 
specific device and will be described by way of exam- 
ple in the specific context of a Digital Signal Proces- 
sor (DSP). According to the invention the device is not 
restricted to a fbced architecture, but has the hard- 
ware re-configuraUe to allow the device (eg. DSP) to 4S 
be optimised for each individual task. Thus at a macro 
level the device may be optimised for a new applica- 
tion for example MPEG, Polygon Engine, Blitter, DMA 
Engine, whilst at a micro level, the device can be opti- 
mised for each OPCODE, eg. MULTIPLE ALU, CUS- so 
TOM MULTIPLY Thus a re-configurable application 
specific device (eg. DSP) allows many custom devic- 
es to be replaced with a single chip. Optimised OP- 
CODES increase performance. In effect the device 
can switch at clock speed between operating as a ss 
DSP, RISC or custom processor. 

Referring f Iretly to Figure 1 . here there is illustrat- 
ed a re-configurable application specif ic digital signal 



processor. The chip includes an area 1 of core cells, 
Partitioned Statk: Random Access Memory (SRAM), 
3. a sequencer and controller 5 having control lines 
7. docks 9 and clock lines 11 , as well as programma- 
ble input/output 13 and associated data bus 15. Also 
shown Is a signal Decompress decoder 17, a commu- 
nications link 19 and assodated Input/output and Ex- 
pansion porting 21 , and address bus 23. 

There are a plurality of core cells 2 and these pro- 
vide for example (In the case of a DSP configuration), 
Instruction Decode, registere, programme counter 
and stack pointer fadlltles. Each core cell can be pro- 
grammed to perlbnm a range of functions and certain 
core cells are optimised to implement specific func- 
tions. Thus, for example, reference to Figure 4 illus- 
trates optimisation of certain ceils for ALU functions 
as at 2a, regstere 2b, programme counter 2c, general 
counter 2d, instructton decode 2e and input/output 2f. 

One schematic oonf iguratbn of core cell denoted 
by dotted outline is shown in Figure 7 and the core cell 
indudes within it a logic cell 22 having selectable 
functions (for example four). Programmable core cell 
Inputs (eight) (le. electrically selectable interoonneo- 
tions) are shown at 25 applied to two 4:1 input multi- 
plexere 26,28. The cell output Is shown at 27. Exanv 
ples of Logic cell configurations are described further 
with reference to Figures 11 , 12.1 3, and 14. Input mul- 
tiplexere are controlled by respective 2-4 Decodere 
30, 32. A further 2-4 Decoder 34, controls a 4-1 Mul- 
tiplexer in the logic cell 22 and an output multiplexer 
70 is controlled by a 2-4 Decoder 48. Direct pre-wired 
connections to the logic cell are indicated by numeral 
YA-YD. 

In the Figure 7 Illustration the cell Indudes con- 
figurable memory provisions comprising configura- 
tion cache 36 and instruction cache 38, as well as so 
called "hard wired" or fixed configuration provisk>ns 
40. For the DSP applicatton the fixed configurations 
comprise a primary DSP Boot Configuration set by 3 
X 2 bit conf iguratk>n elements 40a, and a secondary 
configuration eg. Multiplier configuration set by 3 x 2 
bit configuration elements 40b. it is intended that the 
primary (fbced) oonfiguratran will be implemented au- 
tomatically on boot-up of the device so as to give it its 
primary application specific fundion. 

The conf iguratk>n cache 36 in the illustrated em- 
bodiment comprises four, 3 x 2 bit data stores, 36a-d 
which can be write enabled from an instruction up- 
date bus 44 and written with data from Data bus 46. 
The instructton cache 38 comprises 8 x 2 bit data 
stores which are write enabled from the Instrudion 
update bus (44) and written with data from the data 
bus 46. The instruction cache 38 is read enabled from 
the Instrudion seled bus 42. A 2-4 Decoder 48 en- 
abled from the instructton select bus 42 selects and 
read enables one of the four data stores 36a-d ac- 
cording to the data store of the instrudion cache se- 
lected. The output of Decoder 48 also fadlitates the 
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direct configuration of the logic ceil t>y controlling the 
4:1 output multiplexer 70. Also illustrated is a function 
control bit 50 and has connections from the read and 
write enable lines (42,44} and into the logic cell 22. 
The f unction control bit 50 controls latch 54 (see Fig- s 
ure 10). 

Rgure 16 illustrates, for the fixed configuration 
provisions (40) and the configuration cache 36, the 
read (42). write 44' and data 46' connections. Note 
both read and write provisions for the configuration lo 
cache 36 only. 

Reverting bacic to Figures 2 and 3, each of blocks 
2\ T and 2"' represent configurations of the core 2. 
Large blocks of f unctkinality are accessed as a series 
of configurations. Each new configuration receives is 
data from the last using inter-process connections 52 
and ceils 54 designated for latching critical data. 
Other cells 54 are designated to act as Inputs or out- 
puts. Reconfiguration time can be of the order of 
1 0nsec The core architecture is optimised to imple- 70 
ment each OPCODE. This allows the word size of 
each arithmetic function to be adjusted to the required 
provision. Thus, referring to Figure 3, a first core con- 
f iguratk)n (OPCODE 1) executes a 16 bit multiply and 
cos function, a second core configuration (OPCODE 25 
2) carries out a 32 x 32 bit multiply function, and a 
third configuration (OPCODE 3) carries out a 64 bit 
ADD function. 

Reference is now made to Figure 10 which illus- 
trates the output state control as applicable to the like 30 
of the ceil illustrated in Figure 7 and the correspond- 
ing cell components appropriately referenced are illu- 
strated with the exception of the instructton cache 38. 

As has been mentioned above certain cells are 
designated for latching critical data and hence the 35 
cells have a latch provision 54 with inputs from the 
funGtk>n control bit 50 and a hold input line 56. These 
function to preserve the state of data from cells be- 
tween configurations. In addition a buffer 60 is pro- 
vided in order to reduce transient current when 40 
switching between configurations by setting its out- 
put state to a known condition. 

The cells interconnect resources are now descri- 
bed with reference to Figures 8a. 8b, 9a, and 9b. Fig- 
ures 8a and 8b show diagrammatically how cells 45 
might be arranged in regular blocks (B) (eg. rows and 
columns), with the blocks Including cells which are 
optimised for different functions. Thus Figure 8b 
shows columns of ACC cells, ALU cells and shift ceils, 
and two rows of Decode ceils. Columns of cells each 50 

have two global (Y) buses (Y1, Y2, Y3. Y4 YN1. 

YN) and the rows of cells each have at least two glo- 
bal (X) buses (X1. X2 JCn-1. Xn). The Decode cells 

head up the columns of each block and have three X 
buses. Bus switches BS are provided in the Y buses 55 
between adjacent blocks. In addition there are hUden 
(or pre-wired direct connectton) Y buses, YA-YD. 
These run from the decode cells to all the cells in the 



column below. In addition local direct connection 
paths are preferred between cells. Thus, taking as an 
example cell SC in Figure 8b, it has input connections 
from outputs of an upper adjacent cell, a lower adja- 
cent cell, a right adjacent cell, a left adjacent cell, and 
a next left adjacent cell. These connections are des- 
ignated U, D, R, L, J. Not all cell variattons will nec- 
essarily have all the local connections. The majority 
of these k>cal connectnns are electrically selectable 
as to their conduction state, but nwst usually the left 
adjacent connection will be a pre-wired connection. 

Figure 9a Illustrates, for one cell as for all core 
cells, how an input multiplexer 26 controls selectkm 
of inputs from X and Y buses and an output multiplex- 
er 70 controls selection of outputs to the sanne X bus- 
es and next column of Y buses. 

The cells are arranged In 1 0 x 8 blocks and an ex- 
ample of such an array of cell blocks Is illustrated in 
Figure 9c. Blocks 1 00 are formed in an 8 x 4 an^y and 
a programmable input/output 102, data buses and 
switches 104 and partitioned SRAM 106 are also 
shown. Each block 100 comprises an array of 10 x 8 
cells and conveniently, columns of cells within the 
block have a similar primary conf iguratton. Forexanv 
ple. Figure 9d illustrates a block 100 having two col- 
umns of cells 100 a & b configured as multiplexer 
cells, columns 100 c as a product adder, lOOd barrel 
shifter cells, 1 00 e arthimetic and logic cells, 1 00 f ac- 
cumulator ceils and columns 100 g & h configured as 
multiplier expansion cells. The columns in each block 
are headed up by decode cells. 

Refenring now to Figure 15, the configurable sta- 
tic random access memory (SRAM) 3 stores partition 
data passed to it from the sequencer and controllers 
along partition data bus 72. The operation of the DSP 
requires the storing and retrieving of data and the pro- 
vision of the SRAM on the device ensures that access 
to the stored data is faster than if the SRAM was lo- 
cated externally. 

The sequencer and controller 5 controls the op- 
eratnn of buses 42. 44, 45 and 46. Hence, the se- 
quencer and controller 5 includes the control of the 
operation of selecting individual data stores of cells, 
sending data to the stores and controlling the se- 
quence of implementation of conf iguratbn data stor- 
ed within cell. The necessary control instructfons for 
the sequencer and controller 5 is provided by an ex- 
ternal source of memory (not shown). In addttion to 
the above operattons, the controller 5 can select in- 
dividual data stores not currently used such that they 
can be updated with new configurations from the ex- 
ternal memory. 

Figures 11, 12 and 13 illustrate respective ALU, 
ACC and Decode cell variants. Appropriate referenc- 
es have been used as previously referred to. 

Figure 13 shows an example of a cell optimised 
for decode. Two decode cells will head up the blocks 
of cells as shown in Figures 8a and 8b. The illustrated 
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variation is the one which has the pre-wired intercon- 
nection YA, YB which feed down to each of the cells 
below. The other decode will generate the YC, YD 
pre-wired interconnections. Thus the ALU type cells 
of Figure 11 have pre-wired connections YA, YB. 5 
whilst the ACC type cells have pre-wired connections 
YA. YB, YC, YD. Note also that for the ALU and ACC 
variants the left adjacent connection L Is pre-wnred, 
and for the ALU cell the Cin, Gout is a pre-wired inter- 
connection running the length of the oolumn of cells. io 
Other X and Y buses are as described above. 

Control signals from the outputs of the decode 
and for inputs of the cell variants will be pre-wired for 
the optimised cell functions, ie. for any functions 
which are Icnown to be needed for the specific appli- is 
cation. 

Figure 14 Illustrates some of the different func- 
tions which are available from the ACC and ALU core 
cells of Figures 10 and 11 respectively. 

Figure 17 illustrates an alternative internal cell 20 
arrangement for the case of DSP cells (shown simpli- 
fied) with the cell input shown simply at 25 and cell 
output at 27. The memory comprises 8 x 3 bit data 
stones and a 3-8 Decoder 80 is provided such that one 
of the eight selectable options (eg. functions or inter- 25 
connect) contained in the logic cell can be selected. 
In order to update a particular data store within a par- 
ticular cell there is provided a memory select 45 
(omitted from the illustrations of the previously de- 
scribed cell arrangement) and hence the required cell 30 
can be selected and the particular data store to be 
write enabled or read enabled is selected by the in- 
struction update bus (44) or instruction bus (42). Data 
is written to the data store from menfKiry data bus (46) 
(not illustrated in Figure 17). 35 

A novel adder structure which can be configured 
by the device will now be described with reference to 
Figures 18 to 21 . A 16-bit adder is illustrated in Rgure 
18 and indicated generally by numeral 60. The adder 
comprises a plurality of carry select adders 62 form- 40 
Ing a first multi-bit adder blodc 64 and a second multi- 
bit adder block 66. The adder 60 sums two 16 bit 

words indicated as a1. a2. a3 a16 and b1. b2, 

b3 b16 in order to derive a sum indicated by si. s2, 

s3 s16 and carry element 'Cout*. 45 

Rrst multi-bit adder block 64 sums the eight least 
significant bits of each 16 bit word and for each bit 
there is an associated carry select adder 62. Each 
carry select adder comprises two inputs An. Bn 
(wherein 'n' is the number of the bit), output 68. carry so 
in 70. carry out 72 and a first and second 2:1 multi- 
plexer 74, 76. The first input to the first multiplexer 74 
is equal to the value of An + Bn assuming the carry 
in is '0' and the second input assumes that carry in to 
be 'V. The output Sn is selected by the carry in 70. ss 

The two Inputs to the second multiplexer 76 are 
equal to the carry resulting from the sum of An and 
Bn with the carry in being equal to '0* and '1 '. The car- 



ry out 72 is selected by carry in 70. Obviously, the car- 
ry in to the first carry select adder will be equal to '0*. 

The second multi-bit adder block 66 sums the 
eight most significant bits of each 16 bit word and for 
each bit there are two associated carry select adders. 
78. 80. Each of the canry select adders 78, 80 is con- 
structed In a similar manner as described above. Car- 
ry select adders 78 sum the two eight bit words le^ 89, 

aiO ai6 and bg. biO bi6, assuming that the carry 

out from the first adder block 64 is 'V and carry select 
adders 80 assume that the carry out is '0'. Therefore, 
for each bit two outputs are calculated and fed Into an 
associated multiplexer 82. The output provkJing Sn is 
selected by the canry out from the first adder block 64. 

In operation, the first adder block calculates the 
addition of the eight least significant bits and produc- 
es a carry out value. Simultaneously, the second ad- 
der block calculates the two possible sums of the ad- 
ditbn of the nnost significant bits and the correct sum 
is selected by the carry out produced by adder block 
64. In consequence the time delay to calculate a 16 
bit addition is taken to be the delay in the addition of 
the first eight bits (8ADD) plus the delay In selecting 
the sum of the last eight bit ie. one multiplexer delay 
(MUX). 

For each additional eight bit adder block the time 
delay is equal to one multiplexer. Forexample. a thirty 
two bit adder would result in a propagation delay of 
8ADD + 3 X MUX. In consequence, the adder struc- 
ture described results in an improved speed of oper- 
atk>n compared to that of a conventtonal adder struc- 
ture. 

Figure 20 illustrates an alternative cell structure 
wherein the two carry select adder requiring two cells 
can be replaced by a single configured cell. 

Figure 21 illustrates a conventkinal circuitfbr sin> 
gle stage carry select adder whteh may be used as an 
alternative to the circuit of Figure 19. 

The operation of the device will now be described 
wherein initially, as described above, the configura- 
tion provisions 40 are 'hard wired' or f bced with a DSP 
configuration 40a and a multipler configuration 40b. 

An external memory store (not shown) oonteins 
all the necessary configuration date in order to con- 
trol the controller and sequencersuch that each of the 
data stores (36a-d, 38) in each cell can be progranv 
med. In order to program a date store a typical proce- 
dure would be to firstly select the ceD by memory se- 
lect 45. select the date store to be write enabled by in- 
structton update bus 44 and to write date to the se- 
lected store via date bus 46. 

Each of the four data stores of the configuration 
cache 36 conteins suff k:ient configuration date to se- 
lect the input to the iogk; cell 22 and to also select one 
of the functions contelned within the logic cell. 

The initial boot up operatton of the device results 
in a configuration as per either of the primary oonf ig- 
uratkkns 40a. 40b according to the boot up instruo- 
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tion. Thus for example the DSP or Multiplier configur- 
ation is estabiished. 

However, if the device is required to implement 
another configuration eg. a divide function, then the 
controller and sequencer 5 selects and write enables s 
the required data store of the configuration cache 36 
of each cell necessary to implement the configura- 
tion. The external memory supplies the necessary 
data as to which cell and data stores are to t)e select- 
ed in order to implement the required configuration. io 

There is also the option for adopting other pro- 
grammed configurations from the configuration 
cache and for writing and sut)Stituting other configur- 
ations. 

Thus for the example given, the four configura- is 
tlons possible from the configuration cache may not 
be sufficient. Software programming can be used to 
implement another configuration. The progranvner 
will be able to refer to the technical specifications for 
the device and determine how the desired func- 20 
tron/conflguretion can be implemented (for example 
many possible architecture changes will be listed, 
perhaps in terms of a load instruction). T^us whilst 
load instructions 1-4 might represent the most typical 
configurations which are to be stored in the configur- 25 
ation cache, the programmer determines from the 
technical specification that load instruction 33 for ex- 
ample is required. Thus the programmer will have the 
Instruction loaded into the configuration cache. There 
will be instances where more configurations are re- 30 
quired to process the incoming data then can be stor- 
ed in the cell memory for access at dock speed. How^ 
ever, this difficulty can be overcome by re-progranv 
ming a "redundant" configuration cache with the "ad- 
ditional" configuration data in advance of its require- 35 
ment, by including the re-configuration Instruction in 
the software programme. The sequencer can control 
re-configuration at dock speed, whilst the data from 
the configuration is held safe in the latch cells. The 
four configurations (36a-36d) of the cache can be re- 40 
used in different combinations at different cell sites. 
Ttiis is fadlitated by instruction cache (38) which can 
select different local cell configurations from a global 
instruction placed on instruction bus 42. 



Claims 

1. A configurable semi-conductor integrated circuit 
in which an area (1) thereof is formed with a plur- 50 
ality of cells (2) each having at least one function 
and inter-connections with at least some other 
said cells (2), characterised in that at least some 
of the plurality of cells (2) have interconnections 
(25) which are electricaliy selectable as to their 55 
conduction state, and at least some of the plural- 
ity of ceils (2) have interconnectbns (YA-YD) 
which are pre-wired, each cell has two or more 



possible configurations, each configuration be- 
ing defined by the cell function and/or its inter- 
connection with other cells according to cell con- 
figuration data, and further comprising means 
(36. 38, 40) storing configuration data for at least 
two cell configurations (per cell) and means (30, 
32, 34, 42, 48) to enable one of the possible cell 
configurations according to the cell configuration 
data selected. 

2. A configurable semi-conductor integrated circuit 
as dalmed in dalm 1 in which means (36, 38, 40) 
storing at least two cell configurations are pres- 
ent in the cell. 

3. An integrated circuit as daimed in daim 1 or 2 in 
which the means for selecting the required cell 
configuration comprises an instruction bus (42) 
communicating with the said configuration data 
store. 

4. An integrated circuit as daimed In daim 1 . 2 or 3 
in which at least one of the cell configurations Is 
pre-wired (40a, 40b) to configure the integrated 
circuit with an application specific function when 
selected. 

5. An integrated circuit as daimed in daim 4 in 
which there are two pre-wired (40a, 40b) applica- 
tion specific functions. 

6. An integrated drcuit as daimed in any one of the 
preceding daims In which there is at least one 
programmable cell configuration. 

7. An integrated circuit as daimed in any one of the 
preceding daims in which there are both pre-wi- 
red and programmable cell configurations. 

8. An integrated drcuit as daimed in daim 6 or 7 fur- 
ther comprising a write enable bus (44), and a 
data bus (46) communicating with the means (36, 
38) storing the cell configuration data for the pur- 
pose of rewriting data to the store for re-progranv 
ming purposes. 

9. An integrated drcuit as daimed in any one of the 
preceding daims further comprising means stor- 
ing a plurality of configuration selection instru> 
tlons, an instructions select bus (42) communi- 
cating with said means and an output signal path 
for seleding the required configuration data store 
to be implemented or directly effecting cell con- 
figuration. 

10. An integrated drcuit as claimed in daim 9 further 
comprising an instruction write bus (44) and an 
instruction data bus (46) for writing to the instruo- 
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tion storing means (36. 38). 

11. An integrated circuit as claimed in any one of the 
preceding claims in which means (54) is provided 

to preserve the output t>etween configurations. s 

12. An integrated circuity as claimed In dalm 11 In 
which said means comprises a latch (54) wherein 
each cell incorporates a latch to preserve its out- 
put 10 

13. An Integrated circuit as claimed in any one of the 
preceding claims in which the cells are optimised 
for a primary function. 

14. An integrated circuit as claimed in claim 13 com- 
prising cells which are optimised for different pri- 
mary functions. 

15. An Integrated circuit as claimed in any one of the 20 
preceding claims including means (60} to reduce 
transient current when switching between config- 
urations. 

16. An integrated circuit as dainted in daim 15 in 25 
which said means comprises a controIlat>le buffer 
(60) in the output line of each cell. 

17. An integrated circuit as claimed in any one of the 
preceding daims further comprising sequencer 30 
means (5) to control the availability and selection 

of the configuration. 

18. An integrated circuit as claimed In any one of the 
preceding daims comprising decode means (30, 35 
32, 34, 48) in each cell (2) to decode configura* 

tion state to contrd the configuration of each cell. 

19. An integrated circuit as daimed in any one of 
daims 4 or 5, 13 or 14 in which the configuration 40 
data store corresponding to the primary or appli- 
cation spedfic function of the cell is contained 
within the device in a non-volatile memory. 

20. An integrated circuit as daimed in any one of 45 
daims 13 or 14 in which the pre-wired (hidden) in- 
terconnect resources interconnect optimised 
cells for eff ident implementation of the primary 
(application specific) functions. 

so 

21. A multi-bit adder for summing at least two multi- 
bit words comprising a first multi-bit adder blodc 
(64) for summing the least significant bits and at 
least one further multi-bit adder block (66) for 
sunwning the most significant bits and having 55 
sum selection means wherein said further multi- 
bit adder block calculates the two possible sums 
resulting from a carry out from the previous block 



being equal to '0' and '1* respectively and 
wherein the sum selectton means selects the 
sum of the further multi-bit adder block according 
to the carry out calculated from ttie previous 
bk)dc 

22. A method of configuring a corrflgurable semi- 
conductor integrated drcuit having a plurality of 
cells (2) with at least two configuration possibili- 
ties in which a sequencer (5) is programmed with 
data to fadlltate selection of the required cell con- 
figuratton. 



24. A method as daimed in daim 22 or 23 further 
comprising programming the sequencer with 
data to write over previously stored configuration 
data at a prescribed point in operation of the cir- 
cuit 

25. A configurable semi-conductor integrated drcuit 
characterised in that circuit configuration is 
changed according to a pre-programmed se- 
quence of conf igurattons during operatk>n of the 
devtee. 

26. An integrated circuit as daimed in daim 25 in 
which an area there is formed with a plurality of 
cells, each cell having two or more possible con- 
figurations, each configuration being defined by 
the cell function and for its interconnection with 
other cells according to configuration data. 



23. A method as daimed in daim 22 further compris- 
IS ing inputting and storing cell configuratton data. 
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